Varying the number of principal components for modeling sample structure

نویسندگان

  • Hussein A. Hejase
  • Kevin J. Liu
چکیده

We examined the sensitivity of Coal-Map to the number of covariates used to model global and local sample structures. First, we represented the local sample structure using the top three covariates obtained after applying principal components analysis on the local partition Xl containing the test locus xj. Global sample structure was represented using the top two covariates after performing principal components analysis on the full alignment X excluding the local partition X`. This resulted in two models: one using the covariates W global j = (w1, w2) and the other using the covariates W glocal j = (w1, w2 . . . w5), respectively. We selected one of the aforementioned two models for each test locus xj using the heuristic approach described in the Methods section. In Figure S1, the performance of Coal-Map, using five covariates to represent sample structure (two for global and three for local), and EIGENSTRAT is shown using receiver operating characteristic (ROC) curves. Using Delong et al. test [1] with Benjamini-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling and Forecasting Iranian Inflation with Time Varying BVAR Models

This paper investigates the forecasting performance of different time-varying BVAR models for Iranian inflation. Forecast accuracy of a BVAR model with Litterman’s prior compared with a time-varying BVAR model (a version introduced by Doan et al., 1984); and a modified time-varying BVAR model, where the autoregressive coefficients are held constant and only the deterministic components are allo...

متن کامل

On convergence of sample and population Hilbertian functional principal components

In this article we consider the sequences of sample and population covariance operators for a sequence of arrays of Hilbertian random elements. Then under the assumptions that sequences of the covariance operators norm are uniformly bounded and the sequences of the principal component scores are uniformly sumable, we prove that the convergence of the sequences of covariance operators would impl...

متن کامل

بررسی ساختار جمعیتی گاوهای بومی ایران با استفاده از تحلیل افتراقی مؤلفه‌های اصلی

Effective management of genetic resources in the domestic animals is based on characterization of genetic structure and diversity among populations. Strategies reducing complexity and dimensions of data are required to analyze the genetic relationships between populations based on dense genomic data. The objective of this study was to use the discriminant analysis of principal components (DAPC)...

متن کامل

Functional Analysis of Iranian Temperature and Precipitation by Using Functional Principal Components Analysis

Extended Abstract. When data are in the form of continuous functions, they may challenge classical methods of data analysis based on arguments in finite dimensional spaces, and therefore need theoretical justification. Infinite dimensionality of spaces that data belong to, leads to major statistical methodologies and new insights for analyzing them, which is called functional data analysis (FDA...

متن کامل

Patterns Prediction of Chemotherapy Sensitivity in Cancer Cell lines Using FTIR Spectrum, Neural Network and Principal Components Analysis

    Drug resistance enables cancer cells to break away from cytotoxic effect of anticancer drugs. Identification of resistant phenotype is very important because it can lead to effective treatment plan. There is an interest in developing classifying models of resistance phenotype based on the multivariate data. We have investigated a vibrational spectroscopic approach in order to characterize a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015